Automating Mendelian randomization through machine learning to construct a putative causal map of the human phenome

نویسندگان

  • Gibran Hemani
  • Jack Bowden
  • Philip Haycock
  • Jie Zheng
  • Oliver Davis
  • Peter Flach
  • Tom Gaunt
  • George Davey Smith
چکیده

A major application for genome-wide association studies (GWAS) has been the emerging field of causal inference using Mendelian randomization (MR), where the causal effect between a pair of traits can be estimated using only summary level data. MR depends on SNPs exhibiting vertical pleiotropy, where the SNP influences an outcome phenotype only through an exposure phenotype. Issues arise when this assumption is violated due to SNPs exhibiting horizontal pleiotropy. We demonstrate that across a range of pleiotropy models, instrument selection will be increasingly liable to selecting invalid instruments as GWAS sample sizes continue to grow. Methods have been developed in an attempt to protect MR from different patterns of horizontal pleiotropy, and here we have designed a mixture-of-experts machine learning framework (MR-MoE 1.0) that predicts the most appropriate model to use for any specific causal analysis, improving on both power and false discovery rates. Using the approach, we systematically estimated the causal effects amongst 2407 phenotypes. Almost 90% of causal estimates indicated some level of horizontal pleiotropy. The causal estimates are organised into a publicly available graph database (http://eve.mrbase.org), and we use it here to highlight the numerous challenges that remain in automated causal inference.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

MeRP: a high-throughput pipeline for Mendelian randomization analysis

We present a Mendelian randomization (MR) pipeline (MeRP) to facilitate rapid, causal inference analysis through automating key steps in developing and analyzing genetic instruments obtained from publicly available data. Our tool uses the National Human Genome Research Institute catalog of associations to generate instrumental variable trait files and provides methods for filtering of potential...

متن کامل

Genetic Analysis of Venous Thromboembolism in UK Biobank Identifies the ZFPM2 Locus and Implicates Obesity as a Causal Risk Factor.

BACKGROUND UK Biobank is the world's largest repository for phenotypic and genotypic information for individuals of European ancestry. Here, we leverage UK Biobank to understand the inherited basis for venous thromboembolism (VTE), a leading cause of cardiovascular mortality. METHODS AND RESULTS We identified 3290 VTE cases and 116 868 controls through billing code-based phenotyping. We perfo...

متن کامل

Authors’ response to Hartwig and Davies

1. Davey Smith G, Ebrahim S. ‘Mendelian randomization’: can genetic epidemiology contribute to understanding environmental determinants of disease? Int J Epidemiol 2003;32:1–22. 2. Burgess S, Timpson NJ, Ebrahim S, Davey Smith G. Mendelian randomization: where are we now and where are we going? Int J Epidemiol 2015;44:379–88. 3. Haycock PC, Burgess S, Wade KH, Bowden J, Relton C, Davey Smith G....

متن کامل

Network Mendelian Randomization Study Design to Assess Factors Mediating the Causal Link Between Telomere Length and Heart Disease.

Mendelian randomization study designs represent new powerful tools available to researchers that enable causal inferences to be made about the effects of risk factors in health and disease outcomes in the context of a prospective observational study. These study designs involve estimating the association between a genetically modifiable risk factor and health and disease outcomes. If individual...

متن کامل

Mendelian randomization: how it can--and cannot--help confirm causal relations between nutrition and cancer.

Observational epidemiologic studies of nutrition and cancer have faced formidable methodologic obstacles, including dietary measurement error and confounding. We consider whether Mendelian randomization can help surmount these obstacles. The Mendelian randomization strategy, building on both the accuracy of genotyping and the random assortment of alleles at meiosis, involves searching for an as...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2017